Digital audio is a representation of sound recorded in, or converted into, digital form. In digital audio, the

sound wave In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by the ...

of the

audio signal An audio signal is a representation of sound, typically using either a changing level of electrical voltage for analog signals, or a series of binary numbers for digital signals. Audio signals have frequencies in the audio frequency range of r ...

is typically encoded as numerical samples in a continuous sequence. For example, in

CD audio Compact Disc Digital Audio (CDDA or CD-DA), also known as Digital Audio Compact Disc or simply as Audio CD, is the standard format for audio compact discs. The standard is defined in the ''Red Book'', one of a series of Rainbow Books (named fo ...

, samples are taken 44,100 times per second, each with 16-bit sample depth. Digital audio is also the name for the entire technology of sound recording and reproduction using audio signals that have been encoded in digital form. Following significant advances in digital audio technology during the 1970s and 1980s, it gradually replaced analog audio technology in many areas of

audio engineering Audio most commonly refers to sound, as it is transmitted in signal form. It may also refer to: Sound * Audio signal, an electrical representation of sound *Audio frequency, a frequency in the audio spectrum * Digital audio, representation of sou ...

record production A record producer is a recording project's creative and technical leader, commanding studio time and coaching artists, and in popular genres typically creates the song's very sound and structure.Virgil Moorefield"Introduction" ''The Producer as ...

and telecommunications in the 1990s and 2000s In a digital audio system, an analog electrical signal representing the sound is converted with an analog-to-digital converter (ADC) into a digital signal, typically using pulse-code modulation (PCM). This digital signal can then be recorded, edited, modified, and copied using

computer A computer is a machine that can be programmed to Execution (computing), carry out sequences of arithmetic or logical operations (computation) automatically. Modern digital electronic computers can perform generic sets of operations known as C ...

s, audio playback machines, and other digital tools. For playback, a digital-to-analog converter (DAC) performs the reverse process, converting a digital signal back into an analog signal, which is then sent through an audio power amplifier and ultimately to a loudspeaker. Digital audio systems may include compression,

storage Storage may refer to: Goods Containers * Dry cask storage, for storing high-level radioactive waste * Food storage * Intermodal container, cargo shipping * Storage tank Facilities * Garage (residential), a storage space normally used to store car ...

, processing, and

transmission Transmission may refer to: Medicine, science and technology * Power transmission ** Electric power transmission ** Propulsion transmission, technology allowing controlled application of power *** Automatic transmission *** Manual transmission *** ...

components. Conversion to a digital format allows convenient manipulation, storage, transmission, and retrieval of an audio signal. Unlike analog audio, in which making copies of a recording results in generation loss and degradation of signal quality, digital audio allows an infinite number of copies to be made without any degradation of signal quality.

Overview

Digital audio technologies are used in the recording, manipulation, mass-production, and distribution of sound, including recordings of songs, instrumental pieces, podcasts, sound effects, and other sounds. Modern online music distribution depends on digital recording and data compression. The availability of music as data files, rather than as physical objects, has significantly reduced the costs of distribution as well as made it easier to share copies. Before digital audio, the music industry distributed and sold music by selling physical copies in the form of records and

cassette tape The Compact Cassette or Musicassette (MC), also commonly called the tape cassette, cassette tape, audio cassette, or simply tape or cassette, is an analog magnetic tape recording format for audio recording and playback. Invented by Lou Ottens ...

s. With digital-audio and online distribution systems such as

iTunes iTunes () is a software program that acts as a media player, media library, mobile device management utility, and the client app for the iTunes Store. Developed by Apple Inc., it is used to purchase, play, download, and organize digital mul ...

, companies sell digital sound files to consumers, which the consumer receives over the Internet. Popular streaming services such as Apple Music, Spotify, or Youtube, offer temporary access to the digital file, and are now the most common form of music consumption An analog audio system converts physical waveforms of sound into electrical representations of those waveforms by use of a transducer, such as a microphone. The sounds are then stored on an analog medium such as

magnetic tape Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magne ...

, or transmitted through an analog medium such as a telephone line or radio. The process is reversed for reproduction: the electrical audio signal is amplified and then converted back into physical waveforms via a loudspeaker. Analog audio retains its fundamental wave-like characteristics throughout its storage, transformation, duplication, and amplification. Analog audio signals are susceptible to noise and distortion, due to the innate characteristics of electronic circuits and associated devices. Disturbances in a

digital system Digital electronics is a field of electronics involving the study of digital signals and the engineering of devices that use or produce them. This is in contrast to analog electronics and analog signals. Digital electronic circuits are usually ...

do not result in error unless they are so large as to result in a symbol being misinterpreted as another symbol or disturb the sequence of symbols. It is therefore generally possible to have an entirely error-free digital audio system in which no noise or distortion is introduced between conversion to digital format and conversion back to analog. A digital audio signal may be encoded for correction of any errors that might occur in the storage or transmission of the signal. This technique, known as

channel coding In computing, telecommunication, information theory, and coding theory, an error correction code, sometimes error correcting code, (ECC) is used for controlling errors in data over unreliable or noisy communication channels. The central idea is ...

, is essential for broadcast or recorded digital systems to maintain bit accuracy. Eight-to-fourteen modulation is the channel code used for the audio compact disc (CD).

Conversion process

If an audio signal is analog, a digital audio system starts with an ADC that converts an analog signal to a digital signal.Some audio signals such as those created by

digital synthesis A digital synthesizer is a synthesizer that uses digital signal processing (DSP) techniques to make musical sounds. This in contrast to older analog synthesizers, which produce music using analog electronics, and samplers, which play back digital ...

originate entirely in the digital domain, in which case analog to digital conversion does not take place. The ADC runs at a specified sampling rate and converts at a known bit resolution.

, for example, has a sampling rate of 44.1 kHz (44,100 samples per second), and has 16-bit resolution for each

stereo Stereophonic sound, or more commonly stereo, is a method of sound reproduction that recreates a multi-directional, 3-dimensional audible perspective. This is usually achieved by using two independent audio channels through a configuration ...

channel. Analog signals that have not already been bandlimited must be passed through an anti-aliasing filter before conversion, to prevent the aliasing distortion that is caused by audio signals with frequencies higher than the Nyquist frequency (half the sampling rate). A digital audio signal may be stored or transmitted. Digital audio can be stored on a CD, a digital audio player, a hard drive, a USB flash drive, or any other digital

data storage device Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are conside ...

. The digital signal may be altered through

digital signal processing Digital signal processing (DSP) is the use of digital processing, such as by computers or more specialized digital signal processors, to perform a wide variety of signal processing operations. The digital signals processed in this manner are ...

, where it may be filtered or have effects applied. Sample-rate conversion including upsampling and downsampling may be used to change signals that have been encoded with a different sampling rate to a common sampling rate prior to processing. Audio data compression techniques, such as MP3, Advanced Audio Coding, Ogg Vorbis, or

FLAC FLAC (; Free Lossless Audio Codec) is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference software p ...

, are commonly employed to reduce the file size. Digital audio can be carried over digital audio interfaces such as AES3 or MADI. Digital audio can be carried over a network using audio over Ethernet, audio over IP or other

streaming media Streaming media is multimedia that is delivered and consumed in a continuous manner from a source, with little or no intermediate storage in network elements. ''Streaming'' refers to the delivery method of content, rather than the content it ...

standards and systems. For playback, digital audio must be converted back to an analog signal with a DAC. According to the Nyquist–Shannon sampling theorem, with some practical and theoretical restrictions, a band-limited version of the original analog signal can be accurately reconstructed from the digital signal. During conversion, audio data can be embedded with a digital watermark to prevent piracy and unauthorized use. Watermarking is done using a direct-sequence spread-spectrum (DSSS) method. The audio information is then modulated by a pseudo-noise (PN) sequence, then shaped within the frequency domain and put back in the original signal. The strength of the embedding determines the strength of the watermark on the audio data.

History

Coding

Pulse-code modulation (PCM) was invented by British scientist Alec Reeves in 1937. In 1950, C. Chapin Cutler of Bell Labs filed the patent on differential pulse-code modulation (DPCM), a data compression algorithm. Adaptive DPCM (ADPCM) was introduced by P. Cummiskey, Nikil S. Jayant and James L. Flanagan at Bell Labs in 1973. Perceptual coding was first used for speech coding compression, with linear predictive coding (LPC). Initial concepts for LPC date back to the work of Fumitada Itakura (

Nagoya University , abbreviated to or NU, is a Japanese national research university located in Chikusa-ku, Nagoya. It was the seventh Imperial University in Japan, one of the first five Designated National University and selected as a Top Type university of T ...

) and Shuzo Saito ( Nippon Telegraph and Telephone) in 1966. During the 1970s,

Bishnu S. Atal Bishnu S. Atal (born 1933) is an Indian physicist and engineer. He is a noted researcher in acoustics, and is best known for developments in speech coding. He advanced linear predictive coding (LPC) during the late 1960s to 1970s, and develope ...

and

Manfred R. Schroeder Manfred Robert Schroeder (12 July 1926 – 28 December 2009) was a German physicist, most known for his contributions to acoustics and computer graphics. He wrote three books and published over 150 articles in his field. Born in Ahlen, he stud ...

at Bell Labs developed a form of LPC called

adaptive predictive coding Adaptive predictive coding (APC) is a narrowband analog-to-digital conversion that uses a one-level or multilevel sampling system in which the value of the signal at each sampling instant is predicted according to a linear function of the past valu ...

(APC), a perceptual coding algorithm that exploited the masking properties of the human ear, followed in the early 1980s with the code-excited linear prediction (CELP) algorithm. Discrete cosine transform (DCT) coding, a lossy compression method first proposed by Nasir Ahmed in 1972, provided the basis for the

modified discrete cosine transform The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped transform, lapped: it is designed to be performed on consecutive blocks of a larger ...

(MDCT), which was developed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987.J. P. Princen, A. W. Johnson und A. B. Bradley: ''Subband/transform coding using filter bank designs based on time domain aliasing cancellation'', IEEE Proc. Intl. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2161–2164, 1987. The MDCT is the basis for most audio coding standards, such as

Dolby Digital Dolby Digital, originally synonymous with Dolby AC-3, is the name for what has now become a family of audio compression technologies developed by Dolby Laboratories. Formerly named Dolby Stereo Digital until 1995, the audio compression is lossy ...

(AC-3), MP3 ( MPEG Layer III), Advanced Audio Coding (AAC), Windows Media Audio (WMA), and Vorbis ( Ogg).

Recording

PCM was used in telecommunications applications long before its first use in commercial broadcast and recording. Commercial digital recording was pioneered in Japan by NHK and

Nippon Columbia , often pronounced ''Korombia'', operating internationally as , is a Japanese record label founded in 1910 as Nipponophone Co., Ltd. It affiliated itself with the Columbia Graphophone Company of the United Kingdom and adopted the standard UK C ...

and their

Denon is a Japanese electronics company started in 1910 by Frederick Whitney Horn, an American entrepreneur. Denon produced the first cylinder audio media in Japan and players to play them. Decades later, Denon was involved in the early stages of de ...

brand, in the 1960s. The first commercial digital recordings were released in 1971. The BBC also began to experiment with digital audio in the 1960s. By the early 1970s, it had developed a 2-channel recorder, and in 1972 it deployed a digital audio transmission system that linked their broadcast center to their remote transmitters. Reel to reel Hitachi I (1972)

The first 16-bit PCM recording in the United States was made by Thomas Stockham at the Santa Fe Opera in 1976, on a Soundstream recorder. An improved version of the Soundstream system was used to produce several classical recordings by Telarc in 1978. The 3M digital

multitrack recorder Multitrack recording (MTR), also known as multitracking or tracking, is a method of sound recording developed in 1955 that allows for the separate recording of multiple sound sources or of sound sources recorded at different times to create a ...

in development at the time was based on BBC technology. The first all-digital album recorded on this machine was

Ry Cooder Ryland Peter "Ry" Cooder (born March 15, 1947) is an American musician, songwriter, film score composer, record producer, and writer. He is a multi-instrumentalist but is best known for his slide guitar work, his interest in traditional music, an ...

's '' Bop till You Drop'' in 1979. British record label Decca began development of its own 2-track digital audio recorders in 1978 and released the first European digital recording in 1979. Popular professional digital multitrack recorders produced by Sony/Studer (

DASH The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen b ...

) and Mitsubishi ( ProDigi) in the early 1980s helped to bring about digital recording's acceptance by the major record companies. Machines for these formats had their own transports built-in as well, using

reel-to-reel Reel-to-reel audio tape recording, also called open-reel recording, is magnetic tape audio recording in which the recording tape is spooled between reels. To prepare for use, the ''supply reel'' (or ''feed reel'') containing the tape is plac ...

tape in either 1/4", 1/2", or 1" widths, with the audio data being recorded to the tape using a multi-track stationary tape head. PCM adaptors allowed for stereo digital audio recording on a conventional NTCS or PAL video tape recorder. The 1982 introduction of the CD popularized digital audio with consumers.

ADAT Alesis Digital Audio Tape (ADAT) is a magnetic tape format used for the recording of eight digital audio tracks onto the same S-VHS tape used by consumer VCRs. Although it is a tape-based format, the term ''ADAT'' now refers to its successo ...

became available in the early 1990s, which allowed eight-track 44.1 or 48 kHz recording on S-VHS cassettes, and DTRS performed a similar function with Hi8 tapes. Formats like ProDigi and DASH were referred to as SDAT (Stationary-head Digital Audio Tape) formats, as opposed to formats like the PCM adaptor-based systems and DAT, which were referred to as RDAT (Rotating-head Digital Audio Tape) formats, due to their helical-scan process of recording. Like the DAT cassette, ProDigi and DASH machines also accommodated the obligatory 44.1 kHz sampling rate, but also 48 kHz on all machines, and eventually a 96 kHz sampling rate. They overcame the problems that made typical analog recorders unable to meet the bandwidth (frequency range) demands of digital recording by a combination of higher tape speeds, narrower head gaps used in combination with metal-formulation tapes, and the spreading of data across multiple parallel tracks. Unlike analog systems, modern digital audio workstations and audio interfaces allow as many channels in as many different sampling rates as the computer can effectively run at a single time. Avid Audio and Steinberg released the first digital audio workstation software programs in 1989. Digital audio workstations make multitrack recording and mixing much easier for large projects which would otherwise be difficult with analog equipment.

Telephony

The rapid development and wide adoption of PCM digital telephony was enabled by metal–oxide–semiconductor (MOS)

switched capacitor A switched capacitor (SC) is an electronic circuit that implements a function by moving charges into and out of capacitors when electronic switches are opened and closed. Usually, non-overlapping clock signals are used to control the switches, so ...

(SC) circuit technology, developed in the early 1970s. This led to the development of PCM codec-filter chips in the late 1970s. The silicon-gate

CMOS Complementary metal–oxide–semiconductor (CMOS, pronounced "sea-moss", ) is a type of metal–oxide–semiconductor field-effect transistor (MOSFET) fabrication process that uses complementary and symmetrical pairs of p-type and n-type MOSFE ...

(complementary MOS) PCM codec-filter chip, developed by David A. Hodges and W.C. Black in 1980, has since been the industry standard for digital telephony. By the 1990s, telecommunication networks such as the

public switched telephone network The public switched telephone network (PSTN) provides Communications infrastructure, infrastructure and services for public Telecommunications, telecommunication. The PSTN is the aggregate of the world's circuit-switched telephone networks that ...

(PSTN) had been largely

digitized DigitizationTech Target. (2011, April). Definition: digitization. ''WhatIs.com''. Retrieved December 15, 2021, from https://whatis.techtarget.com/definition/digitization is the process of converting information into a digital (i.e. computer- ...

with VLSI (very large-scale integration) CMOS PCM codec-filters, widely used in electronic switching systems for telephone exchanges, user-end modems and a range of digital transmission applications such as the integrated services digital network (ISDN), cordless telephones and cell phones. Sony PCM-7030 of DR 20111102a

Technologies

Digital audio is used in broadcasting of audio. Standard technologies include Digital audio broadcasting (DAB),

Digital Radio Mondiale Digital Radio Mondiale (DRM; ''mondiale'' being Italian and French for "worldwide") is a set of digital audio broadcasting technologies designed to work over the bands currently used for analogue radio broadcasting including AM broadcasting—pa ...

(DRM),

HD Radio HD Radio (HDR) is a trademark for an in-band on-channel (IBOC) digital radio broadcast technology. It generally simulcasts an existing analog radio station in digital format with less noise and with additional text information. HD Radio is used ...

and In-band on-channel (IBOC). Digital audio in recording applications is stored on audio-specific technologies including CD, Digital Audio Tape (DAT), Digital Compact Cassette (DCC) and

MiniDisc MiniDisc (MD) is an erasable magneto-optical disc-based data storage format offering a capacity of 60, 74, and later, 80 minutes of digitized audio. Sony announced the MiniDisc in September 1992 and released it in November of that year fo ...

. Digital audio may be stored in a standard audio file formats and stored on a Hard disk recorder, Blu-ray or DVD-Audio. Files may be played back on smartphones, computers or

MP3 player A portable media player (PMP) (also including the related digital audio player (DAP)) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored o ...

. Digital audio resolution is measured in sample depth. Most digital audio formats use a sample depth of either 16-bit, 24-bit, and 32-bit. REAPER_Digital_Audio_Workstation

Interfaces

For personal computers, USB and IEEE 1394 have provisions to deliver real-time digital audio. USB interfaces have become increasingly popular among independent audio engineers and producers due to their small size and ease of use. In professional architectural or installation applications, many audio over Ethernet protocols and interfaces exist. In broadcasting, a more general audio over IP network technology is favored. In telephony voice over IP is used as a network interface for digital audio for voice communications. Several interfaces are engineered to carry digital video and audio together, including HDMI and DisplayPort. Some interfaces offer MIDI support as well as XLR and TRS analog ports. Digital-audio-specific interfaces include: *

A2DP In order to use Bluetooth, a device must be compatible with the subset of Bluetooth ''profiles'' (often called services or functions) necessary to use the desired services. A Bluetooth profile is a specification regarding an aspect of Bluetooth-b ...

via Bluetooth * AC'97 (Audio Codec 1997) interface between

integrated circuit An integrated circuit or monolithic integrated circuit (also referred to as an IC, a chip, or a microchip) is a set of electronic circuits on one small flat piece (or "chip") of semiconductor material, usually silicon. Large numbers of tiny ...

s on PC motherboards * ADAT Lightpipe interface * AES3 interface with XLR connectors, common in professional audio equipment * AES47 - professional AES3-style digital audio over Asynchronous Transfer Mode networks * Intel High Definition Audio - modern replacement for AC'97 * I²S (Inter-IC sound) interface between

s in consumer electronics * MADI (Multichannel Audio Digital Interface) * MIDI - low-bandwidth interconnect for carrying instrument data; cannot carry sound but can carry digital sample data in non-real time * S/PDIF - either over

coaxial cable Coaxial cable, or coax (pronounced ) is a type of electrical cable consisting of an inner conductor surrounded by a concentric conducting shield, with the two separated by a dielectric ( insulating material); many coaxial cables also have a p ...

or TOSLINK, common in consumer audio equipment and derived from AES3 * TDIF, TASCAM proprietary format with D-sub cable

Notes

References

External links

* * * {{Music technology